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Abstract 


,  The  objective  of  this  research  is  to  derive  a  minimum  cost  sequence  of  auto¬ 
matic  built-in-tests  (BITs)  which  will  partition  modular  equipment  into  mutually 
exclusive  groups  of  modules.  Following  an  equipment  malfunction,  one  of  these 
groups  will  be  identified  by  the  BIT  diagnostic  subsystem  as  the  group  which  con¬ 
tains  the  faulty  module.  The  BITs  will  not  detect  all  of  the  possible  errors  in 
the  modules,  and  they  may  also  generate  false  alarms  by  calling  out  a  group  of 
modules  which  does  not  contain  the  faulty  unit.  Both  the  cost  of  a  BIT  and  the 
probability  that  a  BIT  will  pass  or  fail  are  functions  of  the  modules  which  are 
tested.  A  recursive  algorithm  is  developed  which  consists  of  a  backward  compu¬ 
tational  process  followed  by  a  forward  computational  process.  The  recursive  al¬ 
gorithm  generates  a  sequence  of  BITs  with  a  minimum--!  i  fn-ryr'H*  cost.  The  algo¬ 
rithm  is  applied  to  a  four-module  sample  problem  to  produce  a  numerical  solution. 
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INTRODUCTION 


This  is  the  second  and  final  report  summarizing  research  performed  under 
a  one  year  minigrant.  Grant  No.  AFOSR-78-3496,  awarded  by  the  Air  Force  Office  of 
Scientific  Research,  Air  Force  Systems  Command,  USAF.  It  is  a  continuation  of  the 
research  described  in  the  first  report,  "Specification  of  Built-in-Tests  for  Mod¬ 
ular  Equipment,"  (19),  mailed  on  May  2,  1978,  to  Mr.  Jerome  Klion,  RADC/RBRT, 
Griffis  AFB,  NY  13441,  and  to  Dr.  Joseph  Brain,  AFOSR/NM,  Bolling  AFB,  DC  20332. 

The  first  report  was  also  sent  on  July  25,  1978,  to  Mr.  1.  L.  Krulac,  RADC/RBRAC, 
Griffiss  AFB,  NY  13441.  General  background  information,  an  introductory  discus¬ 
sion  of  the  basic  problem,  definitions,  and  diagrams  are  contained  in  that  earlier 
report. 

In  modular  airborne  and  ground  electronic  equipment,  "built-in-test  (BIT) 
diagnostic  subsystems"  are  being  used  more  because  they  allow  fewer  and  less  quali¬ 
fied  maintenance  personnel,  and  fewer  pieces  of  external  test  equipment,  which  are 
generally  quite  expensive. 

A  primary  equipment  is  composed  of  modular  line  replaceable  units  (LRUs) , 
all  of  which  car  be  assumed  to  operate  independently.  The  LRUs  have  sufficiently 
low  probabilities  of  failure  so  that  the  probabilities  of  multiple  failures  can 
be  neglected.  Whenever  the  equipment  malfunctions,  a  single  LRU  is  assumed  to 
have  failed,  and  the  BIT  will  automatically  execute  a  fixed  sequence  of  diagnostic 
tests  to  identify  the  group  of  LRUs  which  contains  the  faulty  unit.  Secondary 
isolation  will  be  performed  by  semi-automatic  or  manual  means,  which  incur  time 
and  other  equipment  costs,  to  isolate  the  failed  LRU.  The  defective  unit  will 
be  removed  and  replaced,  and  the  system  retested. 

The  basic  objective  is  to  specify  a  sequence  of  automatic  BITs  that  will 
isolate  a  faulty  LRU  at  a  minimum  life  cycle  cost  (LCC) .  Unlike  the  first  report, 
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in  which  the  BITs  could  detect  all  of  the  equipment  errors,  the  BITs  in  this  re¬ 
port  are  assumed  to  be  imperfect  in  the  sense  that  they  will  not  detect  all  possi¬ 
ble  errors  in  the  LRUs  which  they  test.  Furthermore,  the  BITs  can  also  give  false 
alarms  by  erroneously  indicating  faults  in  LRUs  which  are  functioning  correctly. 

A  recursive  algorithm  is  developed  to  generate  a  minimum  cost  sequence  of 
automatic  BITs.  The  BIT  testing  sequence  partitions  the  equipment  into  mutually 
exclusive  groups  of  LRUs.  Following  an  equipment  malfunction,  one  of  these  mutual¬ 
ly  exclusive  groups  of  LRUs  will  be  identified  by  the  automatic  BIT  diagnostic  as 
the  group  which  contains  the  faulty  LRU.  The  recursive  algorithm  is  applied  to  a 
four-element  sample  problem  to  derive  a  minimum  cost  BIT  testing  sequence. 

The  Air  Force  can  impose  a  number  of  constraints  on  a  BIT  testing  sequence. 

In  this  report  the  following  three  representative  constraints  are  described. 

1.  The  BIT  subsystem  must  be  capable  of  detecting  a  minimum  proportion  of  equip¬ 
ment  faults.  2.  A  specified  percentage  of  the  repair  times  required  to  perform 
secondary  isolation  must  not  exceed  a  given  value.  3.  The  mean  time  to  repair 
the  equipment  must  not  exceed  a  specified  value. 
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NOTATION 


number  of  LRUs  in  the  equipment 

number  of  mutually  exclusive  groups  of  LRUs  into  which  the 
equipment  is  partitioned  by  its  BIT  diagnostic  subsystem 
built-in-test 
line  replaceable  unit 

prior  probability  of  failure  of  LRUi,  given  an  equipment 
malfunction 

probability  that  the  BIT  diagnostic  identifies  group  j  as 
the  group  which  contains  the  single  failed  LRU 
number  of  maintenance  manhours  required  to  perform  secondary 
isolation  of  group  j,  given  that  the  BIT  diagnostic  his 
identified  group  j  as  the  group  which  contains  the  faulty 


expected  number  of  maintenance  manhours  required  to  perform 
secondary  isolation 

expected  number  of  maintenance  manhours  required  to 
troubleshoot  the  entire  equipment  in  the  event  that  the  BIT 
diagnostic  does  not  recognize  that  a  failure  has  occurred 
mean  time  to  repair  the  equipment 

proportion  of  equipment  faults  detectable  by  the  BIT 
diagnostic 
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LCC  life  cycle  cost. 


M  value  of  maintenance  manhours  below  which  a  specified 

max 

percentage,  m2,  of  all  secondary  isolation  maintenance 
actions  should  be  completed;  percentile  of  the  distribu¬ 
tion  of  the  random  variable  MG.. 

1 

n  number  of  untested  LRUs. 

built-in-test  k.  A  test  is  represented  by  an  N-bit  number 
containing  only  the  bits  0  and  1.  A  0  is  assigned  in 
position  i  of  a  test  if  LRUi  must  be  good  in  order  for  the 
test  to  pass.  A  1  is  placed  in  position  i  of  a  test  if 
LRUi  is  not  tested. 

cost  associated  with  built-in-test  T^. 

S  state  of  the  equipment  prior  to  performing  the  test  T^. 

A  state  is  represented  by  an  N-bit  number  containing  only 
the  bits  0  and  1.  The  N  bits  in  the  designation  of  a  state 
correspond,  sequentially  from  left  to  right,  to  LRUI,  LRU2, 
...»  LRUN.  A  0  is  assigned  in  position  i  of  a  state  if 
LRUi  is  known  to  be  good.  A  1  is  assigned  in  position  i  of 
a  state  if  LRUi  is  not  yet  tested.  In  the  initial  state 
there  are  l's  in  all  positions  since  none  of  the  LRUs  have 
been  tested. 

Y  state  of  the  equipment  if  built-in-test  passes.  This 
state  is  computed  by  multiplying  S  and  T^  bit  by  bit  with 
no  carry. 

Z  state  of  the  equipment  if  built-in-test  T^  fails.  This  state 
is  computed  by  multiplying  S  and  T^,  the  complement  of  T^, 
bit  by  bit  without  carry. 


T  complement  of  built-in-test  T. 
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Vst> 


f(S,Tk) 


f(S) 


P(S) 
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a  terminal  state,  which  requires  secondary  isolation  to  find 
the  single  failed  unit. 

cost  of  secondary  isolation  of  terminal  state  St»  which 
corresponds  to  group  j  of  LRUs. 

a  terminal  state  produced  if  built-in-test  T^  passes, 
a  terminal  state  produced  if  built-in-test  fails, 
life  cycle  cost  of  a  sequence  of  built-in-tests,  given  that 
the  current  state  of  the  equipment  is  S  and  built- in-test 
is  performed. 

minimum  life  cycle  cost,  given  that  the  current  state  of 
the  equipment  is  S. 

probability  that  the  BIT  testing  sequence  has  identified 

state  S  as  the  state  which  contains  the  failed  LRU. 

life  cycle  cost  of  built-in-test  T^,  given  that  the  current 

state  of  the  equipment  is  S. 

,1,  if  Y  is  a  non- terminal  state 
l0,  if  Y  is  a  terminal  state 

,1,  if  Z  is  a  non-terminal  state 
l0,  if  Zt  is  a  terminal  state 


SAMPLE  PROBLEM 


A  four-element  sample  problem  with  seven  BITs  is  defined  in  Tables 
1  and  2.  For  each  BIT  the  following  information  is  provided  in  Table  1: 

1}  the  binary  designation  of  the  BIT,  2)  the  L,RU  under  test,  3)  the 
percentage  of  faults  in  the  LRU  under  test  detected  by  the  BIT,  4)  the 
probability  of  a  fault  in  the  LRU  under  test,  given  that  the  equipment  has 


malfunctioned. 

and  5) 

the  cost  of  implementing  the  BIT  for 

the  LRU 

under  test. 

Table  1 

BITs  for 

Individual  LRUs 

in  Sample  Problem 

Binary 
Designation 
of  Bit 

LRU 

under 

test 

Percentage 
of  faults 
detected 
by  BIT 

Conditional 
Probability 
of  fault 
in  LRU 
under  test 

Cost 
of  BIT 
for  LRU 
under  test 

0001 

1 

70 

0.2 

$  11 

• 

2 

85 

0.3 

6 

3 

65 

0.4 

10 

0010 

1 

90 

0.2 

5 

2 

80 

0.3 

13 

4 

75 

0.1 

8 

0011 

1 

82 

0.2 

11 

2 

93 

0.3 

4 

0100 

1 

76 

0.2 

12 

3 

67 

0.4 

15 

4 

84 

0.1 

7 

0101 

X 

87 

0.2 

14 

3 

72 

0.4 

9 

0110 

1 

68 

0.2 

12 

4 

81 

0.1 

8 

0111 

1 

74 

0.2 

10 

The  times  (in  hours)  and  the  costs  (in  dollars)  required  for  secondary 


isolation  of  all  potential  terminal  states,  each  identified  by  an  index  j, 
are  given  in  Table  2.  The  cost  of  secondary  isolation  is  equal  to  the  time 
required  for  secondary  isolation  multiplied  by  a  labor  rate  of  $10  per 
hour. 

Table  2 

Costs  of  Secondary  Isolation  for  Sample  Problem 


Potential 

terminal 


m\ 


>up  j 

LRUs 

state,  St 

(hour) 

Vst> 

1 

1 

1000 

0.3 

$  3 

2 

2 

0100 

0.7 

7 

3 

3 

0010 

0.5 

5 

4 

4 

0001 

0.4 

4 

5 

1,2 

1100 

2 

20 

6 

1,3 

1010 

2 

20 

7 

1,4 

1001 

2 

20 

8 

2,3 

0110 

2 

20 

9 

2,4 

0101 

2 

20 

10 

3,4 

0011 

2 

20 

11 

1,2,3 

1110 

3  ' 

30 

12 

1,2,4 

1101 

3 

30 

13 

1,3,4 

1011 

3 

30 

14 

2,3,4 

0111 ' 

3 

30 

15 

1,2, 3, 4 

1111 

4 

40 

COMPUTATION  OF  STATE-DEPENDENT  PROBABILITIES 
THAT  A  SINGLE  BIT  WILL  PASS  OR  FAIL 


Both  the  cost  of  a  BIT  and  the  probability  that  a  BIT  will  pass  or 
fail  are  state-dependent  because  they  are  both  functions  of  which  LRUs  are 
tested.  For  example,  the  BIT,  will  test  LRUs  1,  2,  and  3  in  state 

1111.  However,  in  state  0110,  since  LRU1  is  known  to  be  good,  T  will 
test  only  LRUs  2  and  3.  In  state  0010  only  LRU3  will  be  tested  by 
The  state-dependent  cost  of  a  BIT  is  simply  the  sum  of  the  costs  of 
implementing  the  BIT  for  each  of  the  LRUs  under  test  in  the  given  state. 
Referring  to  the  costs  in  Table  1,  the  cost  of  Tqqq^  in  state  0110  is 
$6  +  10  =  $16  because  LRUs  2  and  3  are  tested.  For  the  sample  problem 
specified  in  Tables  1  and  2,  the  costs  of  each  BIT  with  different  combina¬ 
tions  of  LRUs  under  test  are  summarized  in  Table  3. 

The  cost  of  a  BIT  is  not  the  only  function  which  is  state-dependent. 
The  probability  that  a  single  BIT  will  pass  or  fail  is  also  dependent  on 


which  LRUs  are  tested.  These  probabilities  are  listed  in  Table  3  for  the 
BITs  in  the  sample  problem.  The  computations  of  these  state-dependent 
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probabilities  in  Table  3  are  illustrated  by  the  following  representative 
calculations  for  the  single  BIT,  Tqqq^,  which  can  test  any  combination  of 
LRUs  1,  2,  and  3. 

1.  First  assume  that  LRUs  1  and  2  are  known  to  be  good.  Only 
LRU1  is  to  be  tested. 


P(T0ooi  fail JlRUI  tested)  =  P(LRU1  bad)P(T00(n  detect  fault] 
LRU1  bad) 

=  0.2(0. 7)  =  0.14 
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p(T0001  PaS3 1 LRU1  tested)  =  P(LRU1  good)  + 

P(LRU1  bad)p(T0001  not  detect  fault|LRUl  bad)  (2) 

0.8  +  0.2(0. 3)  =  0.86  =  ^~p(^0001  fafl|bR01  tested) 

Next  assume  that  only  LRU3  is  known  to  be  good.  LRUs  1  and 
2  are  to  be  tested. 

P^0001  ^aff lbR01» 2  tested)  =  P(LRU1  bad)p(PQQQ2  detect  faultj 
LRU1  bad)  +  P(LRU2  bad)p(T0ooi  detect  fault |LRU2  bad) 

-  P(LRU1  bad)p(T00Q1  detect  fault |LRU1  bad)P(LRU2  bad) 
p(T000i  det&ct  fault |LRU2  bad)  (3) 

=  0.2(0. 7)  +  0.3(0.85)  -  0.2(0.7)0.3(0.85)  =  .359 


P(T0001  PasslLRU1»2  tested)  =  P(LRU1  good)P(LRU2  good) 

+  P(LRU1  bad>p(T0001  not  detect  fault [LRU1  bad)P(LRU2  good) 

+  P(LRU1  good)P(LRU2  bad)P(T0001  not  detect  fault |LRU2  bad) 

+  P(LRU1  bad)p(T00oi  not  detect  fault |LRU1  bad)P(LRU2  bad) 
p(Tqooi  noC  detect  fault  |l.RU2  bad)  (4) 

*  0.8(0. 7)  +  0.2(0. 3)0. 7  +  0.8(0.3)0.15  + 

0.2(0.3)0.3(0.15)  =  0.641 
»  1_P<T0001  failiLRU1>2  tested) 

3.  Finally  assume  that  LRUs  1,  2,  and  3  are  to  be  tested. 

P(T0001faillLRU1’2,3  tested>  =  p(LRUl  bad)P(T0001  detect  fault | 
LRU1  bad) 

+  P(LRU2  bad)P(TpQQ^  detect  fault |LRU2  bad) 

+  P(LRU3  bad)P(Tc001  detect  fault|LRU3  bad) 

-  P(LRU1  bad) P(TQ001  detect  fault |LRU1  bad)P(LRU2  bad) 

P^T0001  detect  fault |LRU2  bad) 


-P(LRU1  bad)P(TQ001  detect  faultjLRUl  bad)P(LRU3  bad) 

PCTqoqi  detect  fault |LRU3  bad) 

-P(LRU2  bad>p(T0001  detect  fault |LRU2  bad)P(LRU3  bad) 

P(Tq^oi  detect  fault |LRU3  bad) 

+P(LRU1  bad)p(*0001  detect  faultjLRUl  bad)F(LRU2  bad) 
p(TOooi  detect  fault|LRU2  bad)P(LRU3  bad) 

f 

p(Tqooi  detect  faultjLRU3  bad)  (5) 

=  0.2(0. 7)  +  0.3(0.85)  +  0.4(0.65)  -  0.2(0.7)0.3(0.85) 

-  0.2(0.7)0.4(0.65)  -  0.3(0.85)0.4(0.65) 

+  0.2(0.7)0.3(0.85)0.4(0.65)  =  0.526 

P(T0001  PassiLRU1»2»3  tested)  *  P(LRU1  good)P(LRU2  good)P(LRU3  goo, 
+  P(LRU1  bad)p(T0001  not  detect  fault |LRU1  bad)P(LRU2  good)P(LRU3 
+  P(LRU1  bad)P(T0001  not  detect  faultjLRUl  bad)P(LRU2  bad) 
p(T0ooi  not  detect  fault jLRU2  bad)P(LRU3  good) 

+  P ( LRU1  bad)P(T0()01  not  detect  fault|LRUl  bad)P(LRU2  good)P(LRU3  \ 
P(Tqooi  not  detect  fault [LRU3  bad) 

+  P(LRU1  good)P(LRU2  bad)P(TQ001  not  detect  fault|LRU2  bad)P(LRU3 
+  P(LRU1  good) P(LRU2  bad)P(T0001  not  detect  fault jLRU2  bad)P(LRU3  ’ 
P(Tqooi  not  detect  fault|LRU3  bad) 

+  P(LRU1  bad)P(T0001  not  detect  faultjLRUl  bad)P(LRU2  bad) 

P(Tq00i  not  detect  fault|l.RU2  bad)P(LRU3  bad) 

P(Tqqoi  not  detect  fault |LRU3  bad) 

+  F(LRU1  good)P(LRU2  good)P(LRU3  bad)P(T00Q1  not  detect  fault] 

LRU 3  bad)  (6) 

=  0.8(0. 7)0. 6  +  0.2(0. 3)0. 7(0.6)  +  0.2(0.3)0.3(0.15)0.6 
+  0.2(0.3)0.7(0.4)0.35  +  0.8(0.3)0.15(0.6) 

*  0.8(0.3)0.15(0.4)0.35  +  0.2(0.3)0.3(0.15)0.4(0.35) 


.  — .A  \  .• 


+  0.8(0.7)0.4(0.35)  =  0.474 
=  1  -  P(Tnnni  fail jLRUl ,2,3  tested) 


Using  formulas  analogous  to  those  developed  for  Tqqq^>  the  values  in 
Table  3  are  computed  for  the  probabilities  that  *11  BITs  will  pass  or 
fail  when  one,  two,  or  three  LRUs  are  tested.  When  specified  LRUs  are 
tested  by  a  BIT,  the  probability  that  the  BIT  will  pass  and  the  probability 
that  the  BIT  will  fail  sum  to  one.  This  result,  which  can  be  proven 
algebraically,  is  apparent  is  each  row  of  Table  3. 


COMPUTATION  OF  STATE  PROBABILITIES 
IN  A  BIT  TESTING  DIAGRAM 


In  a  BIT  testing  diagram  a  node  represents  a  state  of  th2  equipment. 

To  compute  the  life  cycle  cost  of  a  BIT  testing  diagram,  the  probabilities 
of  terminal  states  must  be  calculated.  The  probabilities  of  terminal  states 
are  the  product  of  the  branch  probabilities  which  produce  these  states. 
Branch  probabilities  are  conditioned  on  the  current  state  which  is  tested. 
Upward  sloping  branches  contain  the  probability  that  the  test  on  the  current 
state  passes;  downward  sloping  branches  give  the  probability  that  the  test 
on  the  current  state  fails.  The  probability  of  a  designated  state  is  Lhe 
product  of  the  probabilities  on  all  of  the  upward  sloping  and  downward 
sloping  branches  along  the  path  leading  frrra  the  initial  state  to  the 
designated  state.  More  concisely,  the  probability  of  a  designated  state 
is  equal  to  the  probability  of  the  preceding  state  multiplied  by  the 
probability  of  the  outcome  of  the  test  which  produces  the  designated  state. 


s*\*  'V  %’  *. 

Ta  V  *  *  V  -v  /  .  »•. 


Table  3 

State-dependent  Probabilities  and  Costs 
for  BITs  in  Sample  Problem 


BIT 

LRUs 

under 

P(Tk  pass | LRUs 

P(Tk  fail | LRUs 

"k 

test 

under  test) 

under  test) 

0001 

1 

0.86 

0.14 

2 

0.745 

0.255 

3 

0.74 

0.26 

1,2 

0.641 

0.359 

1,3 

0.636 

0.364 

2,3 

0.551 

0.449 

•• 

1,2,3 

0.474 

0.526 

0010 

1 

0.82 

0.18 

2 

0.76 

0.24 

4 

0.925 

0.075 

1,2 

0.623 

0.377 

1,4 

0.759 

0.241 

2,4 

0.703 

0.297 

1,2,4 

0.576 

0.424 

0011 

1 

0.836 

0.164 

2 

0.721 

0.279 

1,2 

0.603 

0.397 

0100 

1 

0.848 

0.152 

3 

0.732 

0.268 

4 

0.916 

0.084 

1,3 

0.621 

0.379 

1,4 

0.777 

0.223 

3,4 

0.671 

0.529 

1,3,4 

0.569 

0.431 

0101 

1 

0.826 

0.174 

3 

0.712 

0.288 

1,3 

0.588 

0.412 

0110 

1 

0.864 

0.136 

4 

0.919 

0.081 

1,4 

0.794 

0.206 

0111 

1 

0.852 

0.148 

Cost  of  BIT 
for  LRUs 
under  test 
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In  symbolic  terms,  if  S  denotes  the  current  state  and  T,  is  the  test 
performed,  V  is  the  next  stete  If  Tfe  passes,  and  2  Is  the  next  state  if 
Tk  fails,  the  following  pair  of  equations  will  give  the  probability  of  the 


.  next  state: 


P(Y)  =  P(S)P(Tk  passes |S) 
P(Z)  =  P(S)P(Tk  fails |S) 


To  illustrate  the  computation  of  state  probabilities,  consider  the  BI1 
testing  sequence  <T0()0J ,  T^,  T^J.  ihe  corresponding  testing 
diagram  appears  in  Figure  1. 


Figure  1.  BIT  Testing  Diagram 


The  conditional  branch  probabilities  that  the  3ITs  will  pass  or 
fail  have  already  been  computed  in  Table  3. 


p(T000i  fail | 1111)  =  P(T0001  fail [ LRUl, 2, 3  tested)  =  0.526 
P^T0001  Passlmi)  =  P^T0001  pass jLRUl, 2, 3  tested)  =  0.474 
p(T0ioi  fail jlllO)  =  P(TQ101  fail |LRU1, 3  tested)  =  0.412 
p(T0ioi  pass  1 1110)  =  P(T0101  pass | LRUl, 3  tested)  =  0.588 
P(T0010  faill101°)  "  p(T00io  failjLRUl  tested)  =  0.18 
p(T00i0  Pass | 1010)  =  p(T0010  passlLRU1  tested)  =  0.82 


These  branch  probabilities  are  placed  on  the  BIT  testing  diagram  in 
Figure  2. 


Figure  2.  BIT  Testing  Diagram 

with  Branch  probabilities 


The  state  probabilities  for  the  testing  diagram  are  computed  below. 
p(llll)  =  1 

p(0001)  =  p(llll)P(T0001  pass  j 1111)  =  1(0.474)  =  0.474 
p(1110)  =  p(llll)p(T0001  fail | 1111)  =  1(0.526)  =  0.526 


P(0100)  = 


P(IOIO)  = 


P(OOIO)  = 


P(IOOO)  = 


P(1110)P(T0101  pass  1 1110) 
P(1110)P(T0101  fail | 1110) 
P(1010)P(T0010  pass | 1010) 
P(1010)P(T0010  fail | 1010) 


=  0.526(0.588)  =  0.309 


=  0.526(0.412)  =  0.217 


=  0.217(0.82)  =  0.178 


=  0.217(0.18)  =  0.039 


These  state  probabilities  are  placed  on  the  testing  diagram  in  Figure  3. 


P(0001)=0.474 


P(0100)=0.309 


J  Test 

LRU1, 2, 3  NK 


P(0010) =0.178 


P(llll)=l 


P(1000) =0.039 


Figure  3. 


BIT  Testing  Diagram 
with  Branch  Probabilities 
and  State  Probabilities 


The  probabilities  of  the  terminal  states  in  the  testing  diagram 


shown  in  Figure  3  are: 


P1  =  P(1000)  =  0.039 


?2  =  P(0100)  =  0.309 


P3  =  P(0010)  =  0.178 
P4  =  P(0001)  =  0.474 


Since  the  four  terminal  states  in  this  testing  diagram  are  mutually  exclu¬ 


sive  and  exhaustive, 


4 

pj 

J=1  J 


1. 


LIFE  CYCLE  COST  OF  A  BIT  TESTING  SEQUENCE 

The  objective  of  this  research  is  to  generate  a  sequence  of  BITs  with 
a  minimum  life  cycle  cost.  The  life  cycle  cost  of  a  BIT  testing  sequence 
is  equal  to  the  sum  of  the  costs  of  the  BITs  plus  the  expected  cost  of  sec¬ 
ondary  isolation  for  the  mutually  exclusive  groups  of  LRUs  that  comprise  the 
terminal  states  which  can  be  called  out  by  the  BIT  diagnostic.  In  equation 
form 


M 

LCC  =  I  C  +  E[I.(S  ) ]  (8a) 

k=l  K  1 

M  K 

=  EC.  +  I  P.I.(SJ  (8b) 

k=1  *  j=1  3  0  t 

where  M  <_  2^  ^-1 

Equation  (8)  will  be  used  to  compute  the  life  cycle  cost  of  the  BIT 
testing  diagram  given  in  Figure  3.  That  diagram  reappears  in  Figure  4. with 
the  state-dependent  costs  of  the  BITs,  taken  from  Table  3,  added  to  the 
diagram.  Also  added  to  the  testing  diagram  in  Figure  4  are  the  costs  of 
secondary  isolation  for  the  terminal  states,  given  in  Table  2.  State 


probabilities 


appear  in  parentheses. 


v  .474) 


Using  «, nation  (8b)  to  compute  the  life  cycle  cost  of  the  BIT  testing 
diagram  in  Figure  4, 

3  4 

LCC  =  l  C  +  EP.I.(S) 

k=l  fc  j=1  J  J  t 

fcoooijnn  +  coioi[mo  +  cooio|ioioJ 

[Pj^ij^dooo)  +  p2i2(oioo)  +  p3i3(ooio)  +  p^,  i^(oooi)  j  ( 

I$27+$23+$5]  +  [°'039($3)-H).309($7)+0.i78($5)-K).474($4)] 


RECURSIVE  EQUATION  FOR  CALCUI.ATING  THE  LIFE  CYCLE  COST 
OF  A  BIT  TESTING  DIAGRAM 


A  recursive  equation  can  be  formulated  to  calculate  the  life  cycle 

cost  of  a  BIT  testing  diagram.  Let  f(S,Tk>  be  the  life  cycle  cost  of  a 

sequence  of  BITs,  given  that  the  current  state  is  S  and  BIT  T  is 

k 

performed.  Initially  assume  that  test  TR  produces  two  terminal  states, 

symbolized  by  Yfc  and  Z fc.  In  this  case,  following  the  logic  of  equations 
(8b)  and  (7), 


f(S’V  -  CTk|s 


j=l  3  2  1 


(10) 


where 


P(Yt)  -  P(S)P(Tk  pass|s)  if  Tfc  passes 
Pj  “  {P(Zt)  =  P(S)P(Tk  fail | S)  if  Tfc  fails 

and 


(lla) 

(llb) 


VV  • { 


YV  if  Tk 
YV  if  \ 


passes 

fails 


Hence 


(12a) 

(12b) 


f(S,Tfc) 


=  CTk|s  +p(V1j(V  +P(Zt)1j(Zt) 

=  CTk|s  +  P(S>tP(Tk  pass|s)Ij(Yt)+p(Tk  fail|s)I  (Zfc) ] 


(13) 

(14) 


A  recursive  formulation  for  f(S,T^)  expressed  as  a  function  of  f(Y) 
or  f(Z)  is  necessary  whenever  does  not  produce  two  terminal  states. 
Suppose  that  a  single  terminal  state,  Y  ,  is  produced  only  when  T  passes. 
However,  when  fails  assume  that  the  resulting  state,  Z,  is  not  a 
terminal  state.  Then 


f(S,Tk)  =  CT  |S  +  P(S)P(Tk  pass|s)Ij(Yt)  +  f(Z) 

Similarly,  if  a  terminal  state,  Z  ,  is  produced  only  when  T  fails, 

v  IV 


(16) 


f(S,Tk)  *  CT  jg  +  f (Y)  +  P(S)P(Tk  fail|s)Ij(Zt) 


(17) 


Finally,  if  both  Y  and  Z  are  non-terminal  states. 


f(S,Tk)  =  CT  |S  +  f (Y)  +  f (Z) 


(13) 


Equations  (14),  (16),  (17)  and  (18)  can  be  combined  in  a  single  equation 
(19)  as 

f(S,Tk)  =  CT  js  +  y  f(Y)  +  (l-y)P(S)P(Tk  passes |s) I  (Yfc) 
k'  J 

+  z  f(Z)  +  (l-z)P(S)P(Tk  fails |s)Ij(Zt)  (19) 

where 


1, 

if 

Y  is  a 

non-terminal  state 

0, 

if 

Yfc  is  a 

terminal  state 

1, 

if 

Z  is  a 

non-terminal  state 

0, 

if 

Zfc  is  a 

terminal  state 

The  recursive  equations  (19)  and  (15)  will  be  used  to  evaluate  the 


life  cycle  cost  of  the  BIT  testing  diagram  in  Figure  4. 


f (1010)  -  C0010|1010  +  P(1010)[P(T0010  pass | 1010) 1^(0010) 

+  P(T0010  faill1010)Ij(1000)i  (2°) 

=  $5  +  .217[.82(5)  +  .18(3)]  =  $6,007 
f (1110)  =  C0101jm0  +  P(1110)P(T0101  pass  1 1110)  1^0100)  +  f(1010)  (21) 
*  $23  +  .526(. 588)7  +  6.007  =  $31,172 
f(llll)  =  C0001|nll  +  P(1111)P(T0001  pass  1 1111) I.  (0001)  +  f(1110)  (22) 
-  $27  +  1(. 474)4  +  31.172  =  $60,068 
which  agrees  with  the  result  obtained  with  equation  (9). 

RECURSIVE  ALGORITHM  FOR  GENERATING  A  LEAST  COST  SEQUENCE 

OF  BITS 

Equations  (19)  and  (15)  can  be  used  together  to  implement  a  recur¬ 
sive  algorithm  for  generating  a  least  cost  sequence  of  BITS.  The  recur¬ 
sive  algorithm  consists  of  a  backward  solution  procedure  followed  by  a 
forward  computational  process.  A  testing  sequence  car  be  divided  into 
stages,  such  that  each  stage  represents  the  number  of  untested  elements. 

At  each  state  of  every  stage  a  set  of  possible  decisions  consists  of  all 
of  the  BITs  which  can  be  performed  plus  the  alternative  of  performing 
secondary  isolation  of  that  state.  The  solution  procedure  begins  by  moving 
backward  stage  by  stage.  The  backward  solution  procedure  starts  at  stage 
one  in  which  each  state  contains  a  single  failed  LRU.  For  each  state  S 


V 


rJt 


of  every  stage  evaluated  during  the  backward  solution  procedure,  no 
numerical  value  is  known  for  P(S)  because  the  numerical  value  of  P(S) 
is  dependent  on  a  unique  BIT  testing  diagram.  However,  the  desired  BIT 
testing  sequence  is  determined  only  after  completion  of  the  entire  re¬ 
cursive  algorithm.  Therefore,  f(S,TR)  must  be  expressed  as  a  function  of 
the  unknown  variable  P(S)  during  the  backward  recursive  procedure. 

The  evaluation  of  f(S)  during  the  backward  recursive  algorithm  is 
facilitated  by  the  fact  that  f(S,Tk>  can  be  expressed  as  a  linear  function 
of  the  unknown  variable  P(S).  To  understand  why  f(S,T.)  is  a  linear 
function  of  P(S),  observe  that  every  backward  solution  begins  by  con¬ 
sidering  potential  terminal  states  at  stage  one.  Consider  equation  (14) 
in  which  both  Yfc  and  are  terminal  states. 

f(S’V  =  CT,  Is  +  P(S)lP(Tk  pass|s)l  (Yt)  +  P(Tk  fail |s)I . (Z  ) ]  (14) 

Except  for  P(S),  the  numerical  values  of  all  of  the  terms  on  the  right 
hand  side  of  equation  (14)  are  known.  For  example,  in  the  four-element 
sample  problem  the  numerical  values  of  CT  jg,  of  P(Tk  pass|S),  and  of 
p(Tk  fail|s)  can  be  found  in  Table  3.  The  numerical  values  of  both 
VV  and  Ij(Zfc)  are  specified  in  Table  2.  After  appropriate  numerical 
values  have  been  substituted  in  equation  (14),  f(S,Tk)  can  be  represented 
as  a  linear  function  of  P(S),  as  shown  in  equation  (23). 

,(S’V  *ak  +  bkp<s>-  (23) 

where  afc  and  bk  are  constants. 

For  all  states  S,  0  <_  P(S)  <_  1  because  P(S)  is  a  probability. 

Therefore,  it  will  generally  be  possible  to  find  a  unique  value  for  f(S), 


expressed  either  numerically  as  the  value  of  1 t ( S)  which  is  known  from 
Table  2,  or  expressed  instead  as  a  linear  function  of  the  unknown  state 
probability  P(S) .  In  the  latter  case 

f(S)  -  min  {f(S,Tk),  I^S)}  =  a^  +  bR*  P(S), 

where  the  asterisk  denotes  the  value  of  k  which  minimizes  the  function 
f(S:Tk). 

The  backward  recursive  process  ends  at  stage  N  in  which  there  is  a 
single  initial  state  consisting  of  all  l's  since  no  LRUs  have  been  tested. 

The  probability  of  the  initial  state  is  equal  to  1  because  every  testing 
diagram  must  start  with  all  of  the  LRUs  untested.  Since  the  probability 
of  the  initial  state  is  known,  all  other  state  probabilities  can  be  obtained 
by  beginning  at  the  starting  state  and  successively  applying  equation  (7) 
to  calculate  the  probabilities  of  states  at  the  lower  numbered  stages. 

By  executing  these  forward  computations  of  state  probabilities,  an  optimal 
testing  sequence  can  be  constructed  along  with  the  partition  into  mutually 
exclusive  terminal  states  which  this  testing  sequence  produces. 

To  calculate  the  state  probabilities  and  identify  an  optimal  testing 
sequence,  which  has  its  origin  in  the  starting  state,  equations  (7)  are 
used  in  a  forward  computational  process.  In  the  initial  state  S  -  11  .  .  .  1 
and  P(ll  .  .  .  1)  =  1.  Following  the  performance  of  an  optimal  test, 

Tk*,  on  the  initial  state,  the  two  successor  states,  Y  and  Z,  can  be 
determined  by  using  equation  (7). 

P(Y)  =  P(ll  .  .  .  l)P(Tk*  pass 1 11  ...  1)  (7a) 

l)P(Tk*  fail | 11  ...  1) 


P(Z)  »  P(ll 


(7b) 


States  Y  and  Z  are  nodes  at  the  ends  of  the  first  upward  sloping  branch  and 
the  first  downward  sloping  branch  of  an  optimal  testing  diagram  which 
originates  from  a  starting  state  consisting  of  all  l’s.  Equation  (7)  is 
next  applied  to  states  Y  and  Z  by  performing  the  tests  which  are  optimal 
in  each  of  those  states.  This  forward  compx tational  procedure  is  repeated 
at  each  new  non-terminal  successor  state  until  an  entire  minimum  cost 
testing  diagram  and  the  terminal  states  which  it  produces  have  been 
identified. 

SOLUTION  OF  EXAMPLE  BY  RECURSIVE  ALGORITHM 

The  complete  recursive  algorithm  is  demonstrated  by  applying  it  to 
the  four-element  sample  problem  specified  in  Tables  1  and  2.  The  backward 
recursive  computations  are  summarized  in  Tables  4,  5  and  6.  Figures  5,  6 
and  7  are  used  to  illustrate  backward  recursive  calculations  for  repre- 
su.._ativa  entries  in  Tables  4,  5  and  6. 

Table  4 

Values  of  Terminal  States  at  Stage  One 


Key  to  entries  in  a  typical  cell 
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Key  to  entries  in  a  typical  cell 


«vr. 


o\\‘  o  v*  -.•  •.'•  •/  -.*  *.  ■/  %w  \\v- 


y  =  0 


n  *  2 


S  =  0110 


T.  =  0101 
k 


0100  1  1(0100)  -  $7 


0010  '  i(onio)  =  $5 


Figure  5.  Testing  Diagram  for  f(0110,0101) 


Calculations  are  indicated  for  f(0110,  0101)  and  f(0110)  in  Table  5. 


To  evaluate  f(0110,  0101)  as  a  linear  function  of  the  unknown  state  prob¬ 


ability  P(0110),  equation  (14)  is  used  because  both  Yfc  =  0100  and  Z t  =  0010 


are  terminal  states. 


f (0110,0101)  =  C0101j0110  +  P(0110[P(T0101  pass |0110)1(0100) 


+  P(T  fail|0110)I(0010) ] 


=  9  +  P(0110) [0.712(7)  +  0.288(5)] 


=  9  +  6.424P(0110) 


Equation  (15)  is  used  to  find  a  unique  value  for  f(01i0),  in  this  case 


expressed  as  a  linear  function  of  P(0110). 


f(S)  =  min{f  (S,  T,  ),  1 .  (S) } 


f(0110)  ='>nin{f(0110,  0010),  f(0110,  0011),  f(0110,  0100), 


f(0110,  0101),  1(0110)) 


WTO 


wwwrwvwi? 


=  min{l3  +  5.48P(0110),  4  +  5.558P(0110) ,  15  +  6.464P(01i0), 
9  +  6.424P(0110),  20} 

-  4  +  5.558P(0110)  @  Tk*  «  T00n 


1(0001)  -  $4 


f(1100)  -  10 
+  6.408P(1100) 

Figure  6.  Testing  Diagram  for  f (1101, 0011) 

Calculations  for  f(l!01,  0011)  and  f(1101)  in  Table  6  are  illustrated 
below.  Since  =  0001  is  a  terminal  state  while  Z  =  1100  is  not  a  tereii  al 
state,  equation  (16)  is  used  to  evaluate  f(1101,  0011). 

f(1101,  0011)  =  C0011|1101  +  P(1101)P(T0011  pass |ll01)I(0001) 

+  f(1100)  (16) 

From  Table  5, 

f(1100)  =  10  +6.408P(1100) 

Using  equation  (7b)  and  Table  3, 

P(1100)  =  P(1301)P(T0f)11  fail  1 1301)  =  0.397P(1101)  (7b) 


•  .  •  A\V.  ,*/.'  .*  ^ 


Substituting  the  value  of  P(1100)  gives 


fUiOO)  =  10  +  6. 408(0. 337) P (1101) 


f (1101,  0011)  =  15  +  0.603(4)P(1101)  +  [10  +  6. 408(0. 397)P(1101) ] 


=  25  +  4.956P(1101) 


Equation  (15)  is  used  to  find  f(il01)  as  a  linear  function  of  P(1101). 

f(1101)  =  min{f (1101,  0001),  f(1101,  0011),  f(1101,  0100),  f(1101,  0101), 
f(H01,  0110),  f(1101,  0111),  1(1101)) 

=  min{27  +  4.864P(1101) ,  25  +  4.956P(1101) ,  29  +  6. 298P(1101) , 

18  +  4.517P(1101),  30  +  6. 352P(1101) ,  14  +  4 .565P(1101) ,  30} 

=  14  +  4.565P(1101)  at  T  *  =  T_- 

K  Ulll 


n  =  4 


S  =  1111 


T.  =  0110 
k 


y  -  i 


0110  J  f(0110)  = 

4+5.558P(0110) 


f(1001)  = 

10+3. 852P(1001) 


Figure  7.  Testing  Diagram  for  f(llll,0110) 
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Calculations  are  given  for  f (1111,  0110)  and  f (1111)  in  Table  6. 

Since  state  S  =  1111  is  the  initial  state  for  the  testing  diagram, 

P(llll)  =  1.  Knowledge  of  the  starting  state  probability  enables  numerical 
values  to  be  calculated  for  f(llll,  0110)  and  f(llll).  Furthermore,  by 
moving  forward  from  the  starting  state  and  repeatedly  applying  equations  (7), 
the  other  state  probabilities  can  be  computed  and  thus  the  optimal  testing 
diagram  can  be  determined.  Both  Y  =  0110  and  Z  =  1001  are  non- terminal 
states.  Therefore,  equation  (18)  will  be  used  to  compute  a  numerical 
value  for  f(llll,  0110). 


f(im,  ono)  =  c0110|im  +  f (0110)  +  f (1001) 


From  Table  5, 


f(0110)  -  4  +  5. 558P(OJ 10) 


f(1001)  =  10  +  3.852P(1001) 


Using  equations  (7)  and  TabTe  3, 


P(0110)  =  P(illl)P(T0110  pass lllll)  =  1(0.794) 
P(1001)  =  P(lill)P(Tn11n  fail |llll)  =  1(0.206) 


Substituting  the  values  of  P(0110)  and  of  P(1001)  in  the  equations  for 
f(0110)  and  f(1003),  respectively,  gives 


f(llll,  0110)  =  20  +  [4  +  3.353(0.794)]  +  [10  +  3.852(0.206)]  =  39.207 


*.  V**  \  i*.  * 


Equation  (15)  is  used  to  find  a  numerical  value  for  f (1111)  which  is  the 
cost  of  the  minimum  cost  testing  diagram. 

f(llll)  =  rain{f(llll,  0001),  f(llll,  0010),  f(llll,  0011),  f(llll,  0100), 
f(llll,  0101),  f(llll,  0110),  f(llll,  0111),  1(1111)} 

=  min{45.620,  44.816,  38.510,  57.981,  36.756,  39.207, 

27.072,  40} 

*  27.072  at  T.  *  =  0111 
k 

•  Forward  calculations  of  state  probabilities  are  accomplished  by 
successive  applications  of  equation  (7).  States,  state  probabilities  and 
costs  in  the  optimum  solution  for  the  sample  problem  are  summarized  in 


Table  7 
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Table  7 


States,  Tests,  and  Costs  in  the  Optimum  Solution 
for  the  Sample  Problem 


s 

Previous  Present 

State  State 

P(S) 

V 

test 

f(S)  Y 

Next  state  if 
cost  Tk*  passes 

Z 

Next  state 

T*  fails 
k 

None 

1111 

1 

T 

0111 

$27,072 

0111 

1000 

1111 

0111 

.852 

T 

0110 

12+5.432P(0111) 

=$16,628 

0110 

0001 

1111 

1000 

.148 

Ij(lOOO) 

3  P(1000) 

=$0,444 

none 

none 

0111 

0110 

.783 

T0011 

4+5. 558P(0110) 
=$8,352 

0010 

0100 

0111 

0001 

.069 

Ij (0001) 

4P(0001) 

=$0,276 

none 

none 

0110 

0010 

.565 

I..  (0010) 

5P(0010) 

=$2,825 

none 

none 

0110 

0100 

.218 

Ij (0100) 

7P(0100) 

=$1,526 

none 

none 

The  optimum  solution 

for  the 

sample  problem  is  the  BIT 

testing 

sequence 

{T0111* 

T0110’  T0011 

}  with  a 

life  cycle  cost  of  f(llll) 

=  $27,072.  The 

minimum 

cost  testing  diagram 

is  given 

in  Figure  8. 
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fCllll)  =  $27,072 


(.783) 


0010  I  $5 


(.565) 


0100  1  $7 


(.852) 


(.218) 


(.069) 


(.148) 


Figure  8.  Optimum  Testing  Sequence  for  Sample  Problem 


When  the  life  cycle  cost  of  the  optimum  testing  sequence  is  computed 
directly  from  the  testing  diagram  in  Figure  8  by  using  equation  (8b), 


C=  (10+8+4)  +  0.148(3)  +  0.069(4)  +  0.565(5)  +  0.218(7)  =  $27,071,  (24) 


the  result  agrees  with  the  life  cycle  cost  derived  by  applying  the  recur¬ 
sive  algorithm. 


CONSTRAINTS 


A  minimum  cost  solution  consisting  of  a  sequence  of  BITs  and  the  mutual¬ 
ly  exclusive  terminal  states  called  out  by  these  BITs  may  be  required  to  satisfy 


constraints  imposed  by  the  Air  Force.  In  this  report  the  following  three 
constraints  are  described,  although  other  constraints  could  have  been  chosen. 

1.  A  minimum  proportion  of  equipment  faults  shall  be  detectable  by  the 


,  equipment's  BIT  capability. 

2.  A  specified  percentage  of  the  repair  times  required  to  perform  secondary 
isolation  must  not  exceed  a  certain  value. 

3.  The  average  number  of  maintenance  manhours  required  to  repair  the  equip¬ 
ment  must  not  exceed  a  given  value. 


BIT  DETECTION  PROBABILITY 


Consider  an  equipment  consisting  of  N  LRUs  and  a  BIT  diagnostic  subsystem. 
The  probability  that  the  BIT  subsystem  will  detect  a  faulty  LRU  is 


(1-P~)  =  I  P(LRUi=bad)P(BIT  detects  fault  in  LRUil  LRUi=bad) 
r  i=l 


P(B1T  detects  fault  in  LRUiJ  LRUi=bad)  (25) 


Consider  the  four-element  sample  problem  defined  in  Tables  1  and  2. 

Suppose  that  a  particular  BIT  testing  sequence,  {Tq001,  Tq1()1,  and  TQ0 
which  is  diagrammed  in  Figure  4,  is  implemented.  To  compute  the  BIT 
detection  probability  for  this  testing  sequence,  the  probabilities  that  the 
BIT  subsystem  detects  a  fault  in  each  LRU  when  that  LRU  it  known  to  be  bad 
must  be  computed. 

P(BIT  detects  fault  in  LRU4|LRU4  =  bad)  (26) 

=  HTqqqi  does  not  detect  fault  in  LRU1,  2,  or  3|LRU1,  2,  3  =  good) 

=  p(Toooi  Passi111;L>  LRU1 ,2,3  =  good)  =  1 

P(BIT  detects  fault  in  LRU2|LRU2  =  bad) 

=  p(TgQgi  detects  fault  in  LRU1,2,  or  3|LRU2  =  bad) 

P(Tqioi  does  not  detect  fault  in  LRU1  or  3|LRU1,3  =  good) 

=  p(T0001  fail j 1111,  LRU 2  =  bad)P(T01Q1  pass 1 1110,  LRU1.3  =  good)  (27) 
=  0.85(1)  =  0.85 

P(BIT  detects  fault  in  LRU3|LRU3  =  bad) 

=  p(T0001  detects  fault  in  LRU1,2,  or  3|LRU3  =  bad) 

P(Tqioi  detects  fault  in  LRU1  or  3|LRU3  =  bad) 

P(Tqoio  does  not  detect  fault  in  LRUl|LRUl  =  good) 


p(TOioi  detects  fault  in  LRU1  or  3|LRU1  =  bad) 


P(T00l0  detects  fault  in  LRU1lLRlJl  -  bad) 


-  P(TOooi  fail|llll,  LRU1  =  bad)  P(Tmm  fail|lllO,  LRU1  =  bad) 


0101 


P(T0010  faill10;L0>  LRU1  =  bad> 


(25) 


0.7(0.87)0.9  =  0.548 


The  BIT  detection  probability  for  the  testing  sequence  ^Tqqq^»  ^q101 


tooio}  is 


(1-Pp)  =  Pl(0.548)  +  p2(0.85)  +  p3(0.468)  +  pA(l) 

=  0.2(0.548)  +  0.3(0.85)  +  0.4(0.468)  +  0.1(1)  =  0.652  (30) 


Conversely,  the  probability  that  the  BIT  subsystem  will  not  detect  the 


group  of  LRUs  containing  the  faulty  unit  is  P_.  =  0.348.  In  the  latter 

Jc 


event  it  will  be  necessary  to  manually  troubleshoot  the  entire  equipment 
to  isolate  the  faulty  LRU. 


v  v »  Wv  VVv'-.-V/ •  vv.vv.v-,-  ■ 


SPECIFIED  PERCENTILE  FOR  THE  DISTRIBUTION  OF 
MAINTENANCE  MANHOURS  REQUIRED  FOR  SECONDARY  ISOLATION 


Assume  the  probability  distribution  of  the  maintenance  manhours 
required  for  secondary  isolation  is  known  for  ail  of  the  mutually  ex¬ 
clusive  groups  of  LRUs  into  which  the  equipment  has  been  partitioned  by 
its  BIT  diagnostic  subsystem.  An  percentile  of  the  probability  distri¬ 
bution  of  the  random  variable  MG^ ,  representing  the  repair  times  required 
for  the  K  mutually  exclusive  groups  of  LRUs,  is  a  particular  number  of 

maintenance  manhours,  denoted  by  M  ,  such  that  m  percent  of  the  time  the 

max 

equipment  can  be  repaired  in  less  than  M  hours.  The  following  two 

max 

inequalities  must  be  satisfied: 


P(MG.  <  M  )  >  m  l(f 
j  —  max  — 


P (MG .  <  M  )  <  m  10" 
j  max  — 


To  determine  the  ra  percentile  of  the  distribution  o.  MG^,  first 

rank  the  values  of  MG^  in  ascending  order,  for  j  =  1,  2,  . ,.,K.  Next, 

sum  the  probabilities  of  the  ordered  values  of  MG.  until  M  is  identified. 

j  max 

Assume  that  the  sample  problem  defined  in  Tables  1  and  2  has  the  BIT 
testing  sequence  ^tqqq2»  Toi01’  ^0010^  *s  diagrammed  in  Figure  4. 

This  BIT  diagnostic  subsystem  isolates  to  the  single  failed  LRU.  The 
distribution  of  maintenance  times  required  for  secondary  isolation  can  be 
obtained  from  Figure  4,  which  contains  the  probabilities  that  each  of  the 
four  terminal  states  will  be  called  out  by  the  BIT  diagnostic,  and  from 
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Table  2  which  gives  the  time  required  to  perform  secondary  isolation  on 
each  of  these  terminal  states.  The  distribution  of  these  maintenance  times 
is  displayed  in  Table  8. 

Table  8 

Dis  ribution  of  Repa.-  .  mes  for  Secondary  Isolation 

Group  j  1234 


MG^ (hrs) 


0.3  0.7  0.5  0.4 

0.039  0.309  0.178  0.474 


Suppose  that  65%  of  the  time  the  equipment  must  be  repaired  in  less  than 
1.3  hours.  To  determine  whether  the  indicated  BIT  testing  sequence 
satisfies  this  specification,  the  65^  percentile  of  the  distribution  of 
repair  times  will  be  computed.  In  Table  9  the  repair  times  are  ranked  in 
ascending  order. 

Table  9 

Repair  Times  Ranked  in  Ascending  Order 


Group  j 

1 

4 

3 

2 

MG., (hrs) 

0.3 

0.4 

0.5 

0.7 

P. 

J 

0.039 

0.474 

0.178 

0.309 

Summing  the  probabilities  of  the  ranked  repair  times  gives  the  following 
results: 

P(MG.  <  1.2  hr)  »  P(MG  =0.3  hr)  +  P(MG  =  0.4  hr)  +  P(MG  =  0.5  hr) 

3  J  j  J 

=  0.039  +  0.474  +  0.178 

=  0.691^0.65  (33) 

P(MG.  <  1.2  hr)  =  P(MG .  =  0.3  hr)  +  P(MG.  =  0.'.  hr) 

J  3  J 

=  0.035  +  0.474 

=  0.513^  0.65  (34) 

Therefore,  the  65^  percentile  is  =  1.2  hr,  and  the  given  BIT  testing 

sequence  does  meet  the  specification. 

MEAN  TIME  TO  REPAIR  THE  EQUIPMENT 

If  the  automatic  BIT  diagnostic  subsystem  is  capable  of  detecting  the 
group  of  LRUs  which  contains  the  faulty  unit,  then  secondary  isolation  must 
be  performed  to  locate  the  failed  LRU.  The  mean  time  to  complete  secondary 
isolation  is 

K 

ME  =  E [MG . ]  =  I  P .  MG. 

J  j=l  3  J 

In  the  event  that  the  automatic  BIT  subsystem  does  not  identify  the  group 
of  LRUs  that  contains  the  faulty  unit,  manual  troubleshooting  of  the  entire 


equipment  is  required.  The  probability  is  P  that  manual  troubleshooting 

¥ 

will  be  required.  The  mean  time  to  repair  the  equipment  is: 


M  =  ME(1-Pf)  +  MF(Pp) 

K 

=  (1-P  )  l  P  MG.  +  (P  )MF 

r  .  -  j  j  r 

3=1  J  J 


(35) 


BIT  FALSE  ALARMS 

If  the  BIT  subsystem  can  generate  false  alarms  by  indicating  an 
error  in  the  equipment  when  no  error  is  present,  the  previous  analysis 
remains  valid  provided  that  the  probabilities  that  the  individual  BITs 
will  pass  or  fail  are  recomputed.  To  illustrate  the  calculation  of  the 
probabilities  that  a  single  BIT  will  pass  or  fail  when  false  alarms  are 
possible,  consider  the  following  example  presented  in  Table  10  of  a  single 
BIT,  Tqqh*  which  can  test  any  combination  of  LRUs  1  and  2. 

Taole  10 

A  BIT  which  can  Generate  False  Alarms 

Test  LRUk  p.  P(Tnri1 , correctly  detects  fault!  P(T  incorrectly  detect 
k  0011  LRUk  bad)  0011  fault | LRUk  good) 


0011 


1 


0.2 


0.82 


0.06 


•  f 


t 

i 

i 

Assume  that  LRUs  1  and  2  are  to  be  tested. 

P(T0011  falllLRU3-.  2  tested) 

=  P(LRU1  bad)  p(T0011  correctly  detects  fault |LRU1  bad) 

+  P(LRU2  bad)  p(T0Q11  correctly  detects  fault |lRU2  bad) 

+  P(LRU1  good)  p(T0011  incorrectly  detects  fault|LRUl  good) 

+  P(LRU2  good)  p(T0011  incorrectly  detects  fault [LRU2  good) 

-  P(LRU1  bad)  p(T0011  correctly  detects  fault |LRU1  bad) 

j 

!  P(LRU2  bad)  p(T0011  correctly  detects  fault |LRU2  bad) 

-  P(LRU1  bad)  p(T0011  correctly  detects  fault |LRU1  bad) 

P(LRU2  good)  p(T0011  incorrectly  detects  fault |LRU2  good) 

-  P(LRU1  good)  p(T0011  incorrectly  detects  fault |LRU1  good) 

P(LRU2  bad)  p(p0011  correctly  detects  fault |LRU2  bad) 

-  P(LRU1  good)  p(T0011  incorrectly  detects  fault |LRU1  good) 

P(LRU2  good)  p(TQ011  incorrectly  detects  fault |LRU2  good)  (36) 

=  0.2(0.82)  +  0.3(0.93)  +  0.8(0.06)  +  0.7(0.04)  -  0.2(0.82)0.3(0.93) 

-  0.2(0.82)0.7(0.04)  -  0.8(0.06)0.3(0.93)  -  0.8(0.06)0.7(0.04)  =  0.4! 

p(TqOii  passjLRUl,2  tested) 

=  P(LRU1  good)  p(T0011  not  detect  fault |LRU1  good) 

P(LRU2  good)  p(TQ011  not  detect  fault |LRU2  good) 

+  P(LRU1  bad)  p(p0011  not  detect  fault|LRUl  bad) 

P(LRU2  good)  p(TQ011  not  detect  fault |LRU2  good) 

+  P(LRU1  good)  p(TQ011  not  detect  fault |LRU1  good) 

P(LRU2  bad)  p(To011  not  detect  fault |LRU2  bad) 

+  P(LRU1  bad)  p(Tq011  not  detect  fault |LRU1  bad) 

P(LRU  2  bad)  p(Tqq^  not  detect  fault |LRU2  bad)  (37) 


>  /  .*  .-  .*  V  .*  .-  *  j*  ■_-  •/  \*  -  ^  ^ 


=  0.8(0.94)0.7(0.96)  +  0.2(0.18)0.7(0.96)  +  0.8(0.94)0.3(0.07) 
+  0.2(0.18)0.3(0.07)  =  0.546 
“  1  -  P(Toon  fall | LRU  1 ,2  tested) 


Once  the  probabilities  of  BIT  false  alarms  have  been  included  in  the 
probabilities  that  each  BIT  will  pass  or  fail,  the  recursive  algorithm  can 
be  applied  to  derive  a  minimum  cost  BIT  testing  sequence. 


SUMMARY 


.A  recursive  algorithm  has  been  presented  which  will  generate  a  minimum 
cost  sequence  of  BITs.  The  BITs  are  imperfect  tests  in  the  sense  that  they 
may  not  detect  all  of  the  possible  errors  in  the  equipment,  and  they  may  gen¬ 
erate  false  alaims  by  indicating  an  error  when  no  error  is  present. 

The  proportion  of  faults  detectable  by  each  BIT  is  assumed  to  be  known. 
The  probabilities  of  false  alarms  produced  by  the  BITs  ar=>  a1  so  known.  In  ad¬ 
dition,  the  conditional  probabilities  of  failure  of  the  LRUr ,  given  that  the 
equipment  has  malfunctioned,  are  known.  Both  the  cost  of  a  BIT  and  the  prob¬ 
ability  that  a  BIT  will  pass  or  fail  are  functions  of  which  LRUs  remain  to 
be  tested.  The  cost  of  a  BIT  which  tests  a  group  of  LRUs  is  the  s  i  of  the 
costs  of  implementing  the  BIT  for  each  of  the  LRUs  under  test.  The  condi¬ 
tional  probability  that  a  BIT  will  pass  or  fail,  given  that  certain  LRUs  are 
tested,  is  computed  in  terms  of  the  conditional  probabilities  of  failure  of 
the  LRUs,  the  probability  that  the  BIT  will  detect  an  error  in  a  faulty  LRU, 
and  the  probability  that  the  BIT  will  generate  a  false  alarm.  The  life  cycle 
cost  of  a  BIT  testing  sequence  is  equal  to  the  sura  of  the  costs  of  the  BITs 
plus  the  expected  costs  of  secondary  isolation  for  the  mutually  exclusive 
groups  of  LRUs  that  can  be  called  out  by  the  BIT  diagnostic  subsystem. 


The  recursive  algorithm  consists  of  a  backward  computational  process 
followed  by  a  forward  computational  process.  During  the  backward  process 
expected  cost  functions  are  computed  recursively  as  linear  functions  of  un¬ 
known  state  probabilities.  (A  state  distinguishes  untested  LRUs  from  those 
which  are  known  to  be  good.)  During  the  forward  process  the  state  probabil¬ 
ities  are  evaluated  and  a  sequence  of  BITs  with  a  minimum  life  cycle  cost 
is  determined.  The  recursive  algorithm  is  applied  to  a  four-element  sample 
problem  to  produce  a  numerical  solution. 

The  following  three  constraints  are  described.  1.  The  BIT  subsystem 
must.be  capable  of  detecting  a  minimum  proportion  of  equipment  faults.  2. 

A  specified  percentage  of  the  repair  times  required  to  perform  secondary  .. 
isolation  must  not  exceed  a  given  value.  3.  The  mean  time  to  repair  the 
equipment  must  not  exceed  a  specified  value. 


*.  *.  \  *. 
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